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(1) GENERAL INFORMATION: 
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(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Syngenta Patent Dept. 
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(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
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(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/EP99/ 07972 

(B) FILING DATE: 20-OCT-1999 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Meigs, J. Timothy 

(B) REGISTRATION NUMBER: 38,241 

(C) REFERENCE/DOCKET NUMBER: S-30683A 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 919-541-8587 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
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(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACGTGTCCGT GGAGGCGGGT CGGGTCAGTC GGGTCAGATA CCAAGGTGCC AAGTGGAAGG 60 

riGTGGGAIG GATCTAACCA ATGCAAAAGG TTATTACICG AGACACCGAG TTTGTGGAGT 120 

GCACICTAAA ACACCTAAAG TCACTGTGGC TGGTATCGAA CAGAGGJ.T1T GTCAACAGTG 180 

CAGCAGGITT CATCAGCITC CGGAATITGA CCIAGAGAAA AGGAGTTGCC GCAGGAGACT 240 

CGCTGGTCAT AATGAGCGAC GAAGGAAGCC ACAGCCTGCG TCTCTCTCIG TGTTAGCITC 300 

TCGITACGGG AGGATOGCAC CTTCGCTTTA GGAAAATGGT GATGCIGGAA TGAATQGAAG 360 

Cl'l'l ' C l'lGGG AACCAAGAGA TAGGATGGCC AAGTTCAAGA ACASTGGATA CAAGAGIGAT 420 

GAGGCGGCCA GTGTCATCAC CGTCATGGCA GATCAATCCA ATGAATGTAT TTAGTCAAGG 480 

TTCAGTTGGT GGAGGAAGGA CAAGCTTCTC ATCTCCAGAG ATTATGGACA CTAAACIAGA 540 

GAGCTACAAG G 551 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE OiARACIERISTICS : 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 3A35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Met Gly Ser Asn Ser Gly Pro Gly His Gly Pro Gly Gin Ala 
15 10 15 

Glu Ser Gly Gly Ser Ser Thr Glu Ser Ser Ser Phe Ser Gly Gly Leu 
20 25 30 

Met Phe Gly Gin Lys lie Tyr Phe Glu Asp Gly Gly Gly Gly Ser Gly 
35 40 45 

Ser Ser Ser Ser Gly Gly Arg Ser Asn Arg Arg Val Arg Gly Gly Gly 
50 55 60 

Ser Gly Gin Ser Gly Gin He Pro Arg Cys Gin Val Glu Gly Cys Gly 
65 70 75 80 

Met Asp Leu Thr Asn Ala Lys Gly Tyr Tyr Ser Arg His Arg Val Cys 
85 90 95 

Gly Val His Ser Lys Thr Pro Lys Val Thr Val Ala Gly He Glu Gin 
100 105 110 

Arg Phe Cys Gin Gin Cys Ser Arg Phe His Gin Leu Pro Glu Phe Asp 
115 120 125 

Leu Glu Lys Arg Ser Cys Arg Arg Arg Leu Ala Gly His Asn Glu Arg 
130 135 140 

Arg Arg Lys Pro Gin Pro Ala Ser Leu Ser Val Leu Ala Ser Arg Tyr 
145 150 155 160 

Gly Arg He Ala Pro Ser Leu Tyr Glu Asn Gly Asp Ala Gly Met Asn 
165 170 175 

Gly Ser Phe Leu Gly Asn Gin Glu He Gly Trp Pro Ser Ser Arg Thr 
180 185 190 

Leu Asp Thr Arg Val Met Arg Arg Pro Val Ser Ser Pro Ser Trp Gin 
195 200 205 

He Asn Pro Met Asn Val Phe Ser Gin Gly Ser Val Gly Gly Gly Arg 
210 215 220 



Thr Ser Phe Ser Ser Pro Glu He Met Asp Thr Lys Leu Glu Ser Tyr 
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225 230 235 240 

Lys Gly lie Gly Asp Ser Asn Cys Ala Leu Ser Leu Leu Ser Asn Pro 
245 250 255 

His Gin Pro His Asp Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn 
260 265 270 

Asn Asn Asn Thr Trp Arg Ala Ser Ser Gly Phe Gly Pro Met Thr Val 
275 280 285 

Thr Met Ala Gin Pro Pro Pro Ala Pro Ser Gin His Gin Tyr Leu Asn 
290 295 300 

Pro Pro Trp Val Phe Lys Asp Asn Asp Asn Asp Met Ser Pro Val Leu 
305 310 315 320 

Asn Leu Gly Arg Tyr Thr Glu Pro Asp Asn Cys Gin He Ser Ser Gly 
325 330 335 

Thr Ala Met Gly Glu Phe Glu Leu Ser Asp His His His Gin Ser Arg 
340 345 350 

Arg Gin Tyr Met Glu Asp Glu Asn Thr Arg Ala Tyr Asp Ser Ser Ser 
355 360 365 

His His Thr Asn Trp Ser Leu 
370 375 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 859 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B39 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TCAACATTGC TTCCTAACCA GAAATCCACC ATCATCITCC CACGAAXACA ACITAAAGCT 60 

TTACCAGAAA ATGGAGGGTC AGAGAACACA AOGCCGGGCT TACTTGAAAG ACAAGGCTAC 120 

AGTCTCCAAC CITGTTGAAG AAGAAATGGA GAATGGCATG GATGGAGAAG AGGAGGATGG 180 

AGGAGACGAA GACAAAAGGA AGAAGGTGAT GGAAAGAGTT AGAGGTCCTA GCACTGACCG 240 

TGITCCATCG CGACTGTGCC AGGTCGATAG GTGCACTGIT AATTTGACIG AGGCCAAGCA 300 

GTATTACCGC AGACACAGAG TATGTGAAGT ACATGCAAAG GCATCTGCTG CGACTGITGC 360 

AGGGGTCAGG CAACGCTiTT GTCAACAATG CAGCAGGTIT CATGAGCTAC CAGAGTITGA 420 

TGAAGCTAAA AGAAGCTGCA GGAGGCGCTT AGCIGGACAC AATGAGAGGA GGAGGAAGAT 480 

CTCTGGTGAC AGITITGGAG AAGGCTCAGG CCGGAGAGGG TTTAGCGGTC AACTGATCCA 540 

GACTCAAGAA AGAAACAGGG TAGACAGGAA ACTTCCTATG ACCAACTCAT CATITAAGGG 600 

ACCACAGATC AGATAAACCC TCCCGCTCTC TCTCITCrGT CATCTACATA TGCTCIATCT 660 

ACACTCITAT TAGACAAATA ATGGCATCIA ACAATGTCAA GAAAAGITGG TCATGGTATT 720 

AAATCCTAGA GGGAAATATA AGTATAAACC TITAGTCCCC TITATGCTGT CCTGTAATGA 780 

ATATCTATCC GGAAATCTAT TCX3CATACTC TTGCCTCTAA TAATGTTTAT TAAAAAAAAA 840 

AAAAAAAAAA AAAAAAAAA 859 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

5 



S-30683A 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: 3B39 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Gly Gin Arg Thr Gin Arg Arg Gly Tyr Leu Lys Asp Lys Ala 
15 10 15 

Thr Val Ser Asn Leu Val Glu Glu Glu Met Glu Asn Gly Met Asp Gly 
20 25 30 

Glu Glu Glu Asp Gly Gly Asp Glu Asp Lys Arg Lys Lys Val Met Glu 
35 40 45 

Arg Val Arg Gly Pro Ser Thr Asp Arg Val Pro Ser Arg Leu Cys Gin 
50 55 60 

Val Asp Arg Cys Thr Val Asn Leu Thr Glu Ala Lys Gin Tyr Tyr Arg 
65 70 75 80 

Arg His Arg Val Cys Glu Val His Ala Lys Ala Ser Ala Ala Thr Val 
85 90 95 

Ala Gly Val Arg Gin Arg Phe Cys Gin Gin Cys Ser Arg Phe His Glu 
100 105 110 

Leu Pro Glu Phe Asp Glu Ala Lys Arg Ser Cys Arg Arg Arg Leu Ala 
115 120 125 

Gly His Asn Glu Arg Arg Arg Lys He Ser Gly Asp Ser Phe Gly Glu 
130 135 140 

Gly Ser Gly Arg Arg Gly Phe Ser Gly Gin Leu He Gin Thr Gin Glu 
145 150 155 160 

Arg Asn Arg Val Asp Arg Lys Leu Pro Met Thr Asn Ser Ser Phe Lys 
165 170 175 

Gly Pro Gin He Arg 
180 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to rriRNA 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4B19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AGAAGCAGAA AGGIAAAGCT ACAAGTAGTA CTGGAGTTTG TCAGGTCGAG AGITGTACCG 60 

CGGATATGAG CAAAGCCAAA CAGTACCACA AACGACACAA AGTCTGCCAG TITCATGCCA 120 

AAGCTCCTCA TGTTCGGATC TCIGGTCITC ACCAACGTIT CTGCCAACAA TGCAGCAGGT 180 

TTCACGCGCT CAGTGAGITT GATGAAGCCA AGOGGAGITG CAGGAGACGC TTAGCIGGAC 240 

ACAACGAGAG AAGGCGGAAA AGCACAACIG ACIAAAGACG GTGAAACGTG TGAGATCCOG 300 

GITTGAAGGT TAATGAAACA GGCITIGCIT ACTCICITCr GTCACTCTCT TITAGCICCT 360 

TGIAATCCTC TGTGTCTCTG TCTGTITCTC CATATTACCT GTAATCAAAG CTATCIGCEA 420 

AACCTACGAC ATQGTTAAAT AAATGCATIG AGACTTAAAA AAAAAAAAAA AAAAAAAAA 479 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 4B19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Met Arg Arg Ser Lys Ala Glu Gly Lys Arg Ser Leu Arg Glu 
15 10 15 

Leu Ser Glu Glu Glu Glu Glu Glu Glu Glu Thr Glu Asp Glu Asp Thr 
20 25 30 

Phe Glu Glu Glu Glu Ala Leu Glu Lys Lys Gin Lys Gly Lys Ala Thr 
35 40 45 

Ser Ser Ser Gly Val Cys Gin Val Glu Ser Cys Thr Ala Asp Met Ser 
50 55 60 

Lys Ala Lys Gin Tyr His Lys Arg His Lys Val Cys Gin Phe His Ala 
65 70 75 80 

Lys Ala Pro His Val Arg lie Ser Gly Leu His Gin Arg Phe Cys Gin 
85 90 95 

Gin Cys Ser Arg Phe His Ala Leu Ser Glu Phe Asp Glu Ala Lys Arg 
100 ■ 105 110 

Ser Cys Arg Arg Arg Leu Ala Gly His Asn Glu Arg Arg Arg Lys Ser 
115 120 125 

Thr Thr Asp 
130 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A52 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 

GCCATTCAAG GAGACACTAA TGGTGCTCIT ACITTGAATC TTAATGGTGA AAGTGATGGC 60 

CITTTTCCIG CCAAGAAGAC CAAAXCOGGA GCCGTITGTC AGGTCGAAAA CIGTGAAGCT 120 

GATCTTAGTA AAGTTAAGGA TTATCATAGA CGCCATAAGG TCTGTGAGAT GCATTCCAAG 180 

GCTACTAGTG CCACICTCGG AGGTATCITG CAGCGCITTT GTCAGCAATG TAGTAGGITC 240 

CATCTTCIGC CAGGTTTCGA TGACGGAAAG AGAAGTIGTC CTAGACGITT GGCTGGCCAT 300 

AATAAACCTC CGAGGAAAAC AAATCCOGAA CCTGGCGCTA ACGGGAATCC TAGTCATGAT 360 

CACTCAAGCA ACTATCTCIT GATIACICTC TIGAAGATAC TCTCCAATAT GCATAACCAT 420 

ACCGGTGATC AAGATTTGAT GTCICATCIT CTGAAGAGCC TCGIAAGCCA TGCIGGOGAA 480 

CAGTTAGGGA AAAACITAGT TGAAC1TCTT CIACAAGGAG AGATCTCAAG GTICCITAAA 540 

ATATTGGAAA ACTCGGCITT GCITGGGATT GAGCAAGCTC CTCAAGAGGA GTTAAAGCAA 600 

TITTCGGCTC GGCAAGATGG GACAGCTACC GAGAACAGAT CAGAAAAACA AGTCAAAATG 660 

AATGAITTTG ATTTGAATGA TATCTATATA GACTCAGATG ACACAGACGT CGAAAGATCT 720 

CCICCTCCAA CGAATCCAGC GACCAGTTCT CTTGATTATC CTTCATGGAT ACATCAGTCT 780 

AGTCCGCCTC AGACAAGTAG GAATTCAGAT TCAGCATCTG ACCAGTCACC CTCAAGITCr 840 

AGTGAAGATG CTCAGATGQG CACAGGCCGG ATTGTGTTCA AACTATTTGG GAAAGAGCCA 900 

AATGAATTTC CTATTGTCIT ACGAGGACAG ATTCTTGACT GGTTATCGCA TAGTCCAACT 960 

GACATGGAGA GCIACATAAG ACCTGGCTGT ATCGTATIGA CCATCIATCT TCGTCAAGCT 1020 

GAAACTGCIT GGGAAGAACT TTCAGACGAT CIGGGITTTA GCTTAGGGAA GCTTCIAGAT 1080 

CTCICCGATG ATCCCITGTG GACAACIGGA TGGATITATG TAGGGTGCAG AACCAACITG 1140 

CATITGTATA TAACGGTCAG GITGTCGITG ACACTTCATT GTCICIAAAA AGTCGTGATT 1200 

ATAGTCACAT CATTAGCGTT AAACCGCITG CTATAGCTGC AAQGGAGAAG GCTCAATITA 1260 
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CAGTTAAAGG CATGAATCTC QGTQGGGGTG GCACAAGGTT ACITIGITCT (3TTGAAGGAA 1320 

AATACITGAT TCAGGAAACA ACACAQGATT CGAQGACCAG GGAGGATGAC (3ATTTCAAGG 1380 

AGAACAGTGA GATTGTTGAG TGTGTAAACT TCTCTTGTGA TATGCCTATA TIGAGTGGTC 1440 

GAGGATTCAT GGAGATTGAA GACCAAGGAC TCAGTAGCAG CTICH'CCCT TTCTIAGTGG 1500 

TTGAAGATGA OGATCTTTGT TCTGAAATCC GTATACITGA AACCACATTA (SAGTTCACTG 1560 

GAACTGATTC TGCTAAGGAA GCTATGGATT TCATACATGA AATGGGTTGG C1TCT1CACA 1620 

GAAGTAAACT TGGGGAATCA GACCCAAATC CAGGCCTTTT CCGATTAATA QGCITCCAGT 1680 

GGCTAATCGA GTTCICAATG GATCGAGAGT GGTGQGCTGT GATCAGAAAG CTATTAAACA 1740 

TGTICnTGA TGGAGCTGTT GGTGAATTIT CTTCCTCCrC TAATGCCACA (ITGTCAGAAC 1800 

TGTGCCITCr TCACAGAGCC GIGAGGAAAA ACTCrAAGCC TATGGTTGAA ATGCTCITGA 1860 

GATATATTCC CAAGCAACAG AGAAACAGCT TGTITAGACC CGATGCTGCT (3CTCCAGCOG 1920 

GCTTAACACC TCTTGATATT GCAGCTGGTA AAGACGGTTC AGAAGATCTG TIGGAIGGGC 1980 

TAACAGAAGA TCCTGCAATG CTGGGGA1TG AAGCGTCGAA GACATGTCGA GACAGCACAG 2040 

GCITGACACC AGAAGACTAC GCACGCTTAC GCGCTCACTT CTCATACATC CACTIGATTC 2100 

AAGGGAAGAT CAATAAAAAG TCAACAACTG AAGATCATGT TCTGGTCAAC ATCCGAGTIT 2160 

CTTTCTCAGA CAGAGAGCAG AAAGAACCAA AATCAGGTCC GATGGCITCA (3CCITGGAGA 2220 

TCACACAGAT TCCATGCAAG CTCTGTGACC ATAAACTGGT GTATGGGACA ACAQGCAGGT 2280 

CTGTAGCCTA CAGACCAGCT ATGTTGTCAA TGGTGGCGAT TGCIGOGGTT ^[GCCTCTGTG 2340 

TGGCACITCr GTTTAAGAGT TGCCGGGAAG TGCTCTATCT GITTCAACCG ^[TCAGGTGGG 2400 

AGTTATTGGA CTATGGAACA AGCIGAGTGT AAGTCIACIT TGAAAGATCT TCTAAGATAT 2460 

ATATATGAAT GTTACITATA TAAAACGATA GAGGTGTGAT TTCTATATGT AACTATATGA 2520 

GTATAAGATA TAGAGACATG TIGGAGAAGA AGATTGTIGT TATTATTCTT GTIGI'ICTTG 2580 

TTCTGTAAAA GCCTCTCCTA TCTCTCTGGA ACCTAAGGAT TCTCTCTCTG ATTAGIATAT 2640 

TITTIGTITG ACAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 2682 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 848 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A52 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Glu Ala Arg lie Asp Glu Gly Gly Glu Ala Gin Gin Phe Tyr Gly 
15 10 15 

Ser Val Gly Asn Ser Ser Asn Ser Ser Ser Ser Cys Ser Asp Glu Gly 
20 25 30 

Asn Asp Lys Lys Arg Arg Ala Val Ala lie Gin Gly Asp Thr Asn Gly 
35 40 45 

Ala Leu Thr Leu Asn Leu Asn Gly Glu Ser Asp Gly Leu Phe Pro Ala 
50 55 60 

Lys Lys Thr Lys Ser Gly Ala Val Cys Gin Val Glu Asn Cys Glu Ala 
65 70 75 80 

Asp Leu Ser Lys Val Lys Asp Tyr His Arg Arg His Lys Val Cys Glu 
85 90 95 

Met His Ser Lys Ala Thr Ser Ala Thr Val Gly Gly lie Leu Gin Arg 
100 105 110 

Phe Cys Gin Gin Cys Ser Arg Phe His Leu Leu Pro Gly Phe Asp Asp 
115 120 125 

Gly Lys Arg Ser Cys Arg Arg Arg Leu Ala Gly His Asn Lys Arg Pro 
130 135 140 
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Arg Lys Thr Asn Pro Glu Pro Gly Ala Asn Gly Asn Pro Ser Asp Asp 
14 5 150 155 160 

His Ser Ser Asn Tyr Leu Leu He Thr Leu Leu Lys He Leu Ser Asn 
165 170 175 

Met His Asn His Thr Gly Asp Gin Asp Leu Met Ser His Leu Leu Lys 
180 185 190 

Ser Leu Val Ser His Ala Gly Glu Gin Leu Gly Lys Asn Leu Val Glu 
195 200 205 

Leu Leu Leu Gin Gly Arg Arg Ser Gin Gly Ser Leu Asn He Gly Asn 
210 215 220 

Ser Ala Leu Leu Gly He Glu Gin Ala Pro Gin Glu Glu Leu Lys Gin 
225 230 235 240 

Phe Ser Ala Arg Gin Asp Gly Thr Ala Thr Glu Asn Arg Ser Glu Lys 
245 250 " 255 

Gin Val Lys Met Asn Asp Phe Asp Leu Asn Asp He Tyr He Asp Ser 
260 265 270 

Asp Asp Thr Asp Val Glu Arg Ser Pro Pro Pro Thr Asn Pro Ala Thr 
275 280 285 

Ser Ser Leu Asp Tyr Pro Ser Trp He His Gin Ser Ser Pro Pro Gin 
290 295 300 

Thr Ser Arg Asn Ser Asp Ser Ala Ser Asp Gin Ser Pro Ser Ser Ser 
305 310 315 320 

Ser Glu Asp Ala Gin Met Arg Thr Gly Arg He Val Phe Lys Leu Phe 
325 330 335 

Gly Lys Glu Pro Asn Glu Phe Pro He Val Leu Arg Gly Gin He Leu 
340 345 350 

Asp Trp Leu Ser His Ser Pro Thr Asp Met Glu Ser Tyr He Arg Pro 
355 360 365 

Gly Cys He Val Leu Thr He Tyr Leu Arg Gin Ala Glu Thr Ala Trp 
370 375 380 

Glu Glu Leu Ser Asp Asp Leu Gly Phe Ser Leu Gly Lys Leu Leu Asp 
385 390 395 400 
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Leu Ser Asp Asp Pro Leu Trp Thr Thr Gly Trp lie Tyr Val Arg Val 
405 410 415 

Gin Asn Gin Leu Ala Phe Val Tyr Asn Gly Gin Val Val Val Asp Thr 
420 425 430 

Ser Leu Ser Leu Lys Ser Arg Asp Tyr Ser His He He Ser Val Lys 
435 440 445 

Pro Leu Ala He Ala Ala Thr Glu Lys Ala Gin Phe Thr Val Lys Gly 
450 455 460 

Met Asn Leu Arg Arg Arg Gly Thr Arg Leu Leu Cys Ser Val Glu Gly 
465 470 475 480 

Lys Tyr Leu He Gin Glu Thr Thr His Asp Ser Thr Thr Arg Glu Asp 
485 490 495 

Asp Asp Phe Lys Asp Asn Ser Glu He Val Glu Cys Val Asn Phe Ser 
500 505 510 

Cys Asp Met Pro He Leu Ser Gly Arg Gly Phe Met Glu He Glu Asp 
515 520 525 

Gin Gly Leu Ser Ser Ser Phe Phe Pro Phe Leu Val Val Glu Asp Asp 
530 535 540 

Asp Val Cys Ser Glu He Arg He Leu Glu Thr Thr Leu Glu Phe Thr 
545 550 555 560 

Gly Thr Asp Ser Ala Lys Gin Ala Met Asp Phe He His Glu He Gly 
565 570 575 

Trp Leu Leu His Arg Ser Lys Leu Gly Glu Ser Asp Pro Asn Pro Gly 
580 585 590 

Val Phe Pro Leu He Arg Phe Gin Trp Leu He Glu Phe Ser Met Asp 
595 600 605 

Arg Glu Trp Cys Ala Val He Arg Lys Leu Leu Asn Met Phe Phe Asp 
610 615 620 

Gly Ala Val Gly Glu Phe Ser Ser Ser Ser Asn Ala Thr Leu Ser Glu 
625 630 635 640 

Leu Cys Leu Leu His Arg Ala Val Arg Lys Asn Ser Lys Pro Met Val 
645 650 655 



Glu Met Leu Leu Arg Tyr He Pro Lys Gin Gin Arg Asn Ser Leu Phe 
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660 665 670 

Arg Pro Asp Ala Ala Gly Pro Ala Gly Leu Thr Pro Leu His He Ala 
675 680 685 

Ala Gly Lys Asp Gly Ser Glu Asp Val Leu Asp Ala Leu Thr Glu Asp 
690 695 700 

Pro Ala Met Val Gly He Glu Ala Trp Lys Thr Cys Arg Asp Ser Thr 
705 710 715 ~ 720 

Gly Phe Thr Pro Glu Asp Tyr Ala Arg Leu Arg Gly His Phe Ser Tyr 
725 730 735 

He His Leu He Gin Arg Lys He Asn Lys Lys Ser Thr Thr Glu Asp 
740 745 750 

His Val Val Val Asn He Pro Val Ser Phe Ser Asp Arg Glu Gin Lys 
755 760 765 

Glu Pro Lys Ser Gly Pro Met Ala Ser Ala Leu Glu He Thr Gin He 
770 775 780 

Pro Cys Lys Leu Cys Asp His Lys Leu Val Tyr Gly Thr Thr Arg Arg 
785 790 795 800 

Ser Val Ala Tyr Arg Pro Ala Met Leu Ser Met Val Ala He Ala Ala 
805 810 815 

Val Cys Val Cys Val Ala Leu Leu Phe Lys Ser Cys Pro Glu Val Leu 
820 825 830 

Tyr Val Phe Gin Pro Phe Arg Trp Glu Leu Leu Asp Tyr Gly Thr Ser 
835 840 845 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 
<B) CLONE: 4B11 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

CAGCGGAAGA GCTCACCGTT GAAGAGAGGA ATCTCCTCTC TGTTGCTTAC AAAAAOGTGA 60 

TCGGATCTCT ACGCGCCGCC TGGAGGATOG TGTCTTCGAT TGAGCAGAAG GAAGAGftGTA 120 

GGAAGAACGA CGAGCACGTG TOGCITGTCA AGGATTACAG ATCTAAAGIT GAGTCIGAGC 180 

TTTCITCTGr TTGCTCTGGA ATCCTTAAGC TCCTTGACIC GCATCIGATC CCATCIGCIG 240 

GAGCGAGTGA GTCTAAGGTC 1TITACITGA AGATGAAAGG TGATTATCAT CGGTACATGG 300 

CTGAGITTAA GTCTGGTGAT GAGAGGAAAA CTGCIGCTGA AGATACCATG CTCGCTTACA 360 

AAGCAGCTCA GGATATCGCA GCTGCGGATA TGGCACCTAC TCATCCGATA AGGCTTGGTC 420 

TGGCCCTGAA TTTCTCAGTG ITCTACTATG AGATTCTCAA TTCITCAGAC AAAGCITGTA 480 

ACATGGCCAA ACAGGCITTT GAGGAAGCCA TAGCTGAGCT TGACACTCIG GGAGAAGAAT 540 

CCTACAAAGA CAGCACTCTC ATAATGCAGT T3CTGA 576 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4B11 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Thr Leu Gly Arg Asp Gin Tyr Val Tyr Met. Ala Lys Leu 
15 10 15 

Ala Glu Gin Ala Glu Arg Tyr Glu Glu Met Val Gin Phe Met Glu Gin 
20 25 30 

Leu Val Thr Gly Ala Thr Pro Ala Glu Glu Leu Thr Val Glu Glu Arg 
35 40 45 

Asn Leu Leu Ser Val Ala Tyr Lys Asn Val He Gly Sex- Leu Arg Ala 
50 55 60 

Ala Trp Arg He Val Ser Ser He Glu Gin Lys Glu Glu Ser Arg Lys 
65 70 75 80 

Asn Asp Glu His Val Ser Leu Val Lys Asp Tyr Arg Ser Lys Val Glu 
85 90 95 

Ser Glu Leu Ser Ser Val Cys Ser Gly He Leu Lys Leu Leu Asp Ser 
100 105 110 

His Leu He Pro Ser Ala Gly Ala Ser Glu Ser Lys Val Phe Tyr Leu 
115 120 125 

Lys Met Lys Gly Asp Tyr His Arg Tyr Met Ala Glu Phe Lys Ser Gly 
130 135 140 

Asp Glu Arg Lys Thr Ala Ala Glu Asp Thr Met Leu Ala Tyr Lys Ala 
145 150 155 160 

Ala Gin Asp He Ala Ala Ala Asp Met Ala Pro Thr His Pro He Arg 
165 170 175 

Leu Gly Leu Ala Leu Asn Phe Ser Val Phe Tyr Tyr Glu He Leu Asn 
180 185 190 

Ser Ser Asp Lys Ala Cys Asn Met Ala Lys Gin Ala Phe Glu Glu Ala 
195 200 205 

He Ala Glu Leu Asp Thr Leu Gly Glu Glu Ser Tyr Lys Asp Ser Thr 

210 215 220 

Leu He Met Gin Leu Leu Arg Asp Asn Leu Thr Leu Trp Thr Ser Asp 
225 230 235 240 
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Met Gin Glu Gin Met Asp Glu Ala 
245 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CGCCGCCACC GOGATGTAOG TGATCTACCA CCCTCGTCCG CCGrOGTTCT CCGTCCCGTC 60 

AATAAGAATC AGCCGCGIGA ACCTAACAAC CTCCTCTGAT TCCTCQGTCT CTCATCTCTC 120 

TTCCTTCTTC AACFTCACTC TAATCTCAGA GAATCCAAAC CAACACCTCT CnTCTCITA 180 

CGATCCITTC ACOGTCACCG TTAATTCAGC TAAATCCGGI ACGATGCTCG GTAACOGAAC 240 

TGITCCTGCr TICITCAGCG ATAACGGTAA GAAAACTTCG TTTCACGGCG TGATCGCTAC 300 

GICTACAGCG GCGCGTGAGT TAGATCOGGA TGAAGCTAAG CATCTGAGAT CAGATCIGAC 360 

GCGQGCGCGT GTAGGATATG AGATCGAGAT GAGAACTAAA GTGAAGATGA TAATGGGGAA 420 

GCIGAAGAGT GAAGGAGTAG AGATCAAAGT GACATGTTGA AGGATTTGAA GGAACTATAC 480 

CAAAAGGTAA AACTCCAATT GTAGCTACTT CTAAAAAAAC TAAGTGTAAG TCTGATCTTA 540 

GIGTCAAGTC TQGAAATGGA TTTCTAAAGG AATTTGATAA TTTCACATTG AAATTCIATA 600 

TATCTCICTT TTTCTCTGGA TTTGTGAAAC TTTGGATGAT CAAAGAATTC TTCATTGTC 659 
(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg He Cys Cys Cys Cys Phe Trp Ser He Leu He He Leu He Leu 
15 10 15 

Ala Leu Met Thr Ala He Ala Ala Thr Ala Met Tyr Val He Tyr His 
20 25 30 

Pro Arg Pro Pro Ser Phe Ser Val Pro Ser He Arg He Ser Arg Val 
35 40 45 

Asn Leu Thr Thr Ser Ser Asp Ser Ser Val Ser His Leu Ser Ser Phe 
50 55 60 

Phe Asn Phe Thr Leu He Ser Glu Asn Pro Asn Gin His Leu Ser Phe 
65 70 75 80 

Ser Tyr Asp Pro Phe Thr Val Thr Val Asn Ser Ala Lys Ser Gly Thr 
85 90 95 

Met Leu Gly Asn Gly Thr Val Pro Ala Phe Phe Ser Asp Asn Gly Asn 
100 105 110 

Lys Thr Ser Phe His Gly Val He Ala Thr Ser Thr Ala Ala Arg Glu 
115 120 125 

Leu Asp Pro Asp Glu Ala Lys His Leu Arg Ser Asp Leu Thr Arg Ala 
130 135 140 
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Arg Val Gly Tyr Glu He Glu Met Arg Thr Lys Val Lys Met He Met 
145 150 155 160 

Gly Lys Leu Lys Ser Glu Gly Val Glu He Lys Val Thr Cys 
165 170 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B76 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCTCCAACTC CAGGCCAGCC AACAAAAGAA CCTACATTTA TTCCAGTGGT TGTTGGTCIT 60 

TTGGACTCAA GTGGGAAAGA CATTACTCTT TCCTCTGITC ATTATGATGG TACAGTGCAG 120 

ACCATTTCAG GCAGCAGCAC AATACTTCGA GTGACAAGAA ACAAGAAGAG TITGIGTITT 180 

CTGATATACC AGAAAGACCT GTTCCGTCCC TAITTAGGGG ATTCAGCCCC AGITCGIGTT 240 

GAAACTGATC TCTCTAATGA TGACITATTC TTCCTCCTAG CACATGATTC A3ATGAA1TC 300 

AATAGGTQGG AGGCOGGTCA AGTTCTGGCA AGAAAGCTGA TGCTGAACTT AGTTTCIGAT 360 

TTCCAGCAAA ATAAACCGTT GGCTCTAAAC CCAAAATTTG TGCAAGGTCT G3GCAGTGTG 420 

CITTCTGACT CAAGCTTGGA CAAGGAATTT ATAGCCAAAG CAATAACACT ACCTGGGGAG 480 

GGAGAGATAA TGGACATGAT GGCCGTGGOG GATCCTGATG CTGTTCATGC TCTTAGAAAG 540 

TTTGTACGAA AGCAGCTTGC ATCIGAACIT AAGGAGGAGC TTCT 584 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B76 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Pro Thr Pro Gly Gin Pro Thr Lys Glu Pro Thr Phe lie Pro Val 
15 10 15 

Val Val Gly Leu Leu Asp Ser Ser Gly Lys Asp lie Thr Leu Ser Ser 
20 25 30 

Val His Tyr Asp Gly Thr Val Gin Thr He Thr Gly Ser Ser Thr He 
35 40 45 

Leu Arg Val Thr Lys Lys Gin Glu Glu Phe Val Phe Ser Asp He Pro 
50 55 60 

Glu Arg Pro Val Pro Ser Leu Phe Arg Gly Phe Ser Ala Pro Val Arg 
65 70 75 80 

Val Glu Thr Asp Leu Ser Asn Asp Asp Leu Phe Phe Leu Leu Ala His 
85 90 95 

Asp Ser Asp Glu Phe Asn Arg Trp Glu Ala Gly Gin Val Leu Ala Arg 
100 105 110 

Lys Leu Met Leu Asn Leu Val Ser Asp Phe Gin Gin Asn Lys Pro Leu 
115 120 125 

Ala Leu Asn Pro Lys Phe Val Gin Gly Leu Gly Ser Val Leu Ser Asp 
130 135 140 
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Ser Ser Leu Asp Lys Glu Phe lie Ala Lys Ala lie Thr Leu Pro Gly 
145 150 155 160 

Glu Gly Glu He Met Asp Met Met Ala Val Ala Asp Pro Asp Ala Val 
165 170 175 

His Ala Val Arg Lys Phe Val Arg Lys Gin Leu Ala Ser Glu Leu Lys 
180 185 190 

Glu Glu Leu Lys He Val Glu Asn Asn Arg Ser Thr Glu Ala Tyr Val 
195 200 205 

Phe Asp His Ser Asn Met Ala Arg Arg Ala Leu Lys Asn Thr Ala Leu 
210 215 220 

Ala Tyr Leu Ala Ser Leu Glu Asp Pro Ala Tyr Met Gly Thr Cys Thr 
225 230 235 240 

Glu Arg He Gin Gly Gly His Gin Phe Asp Arg Pro He Cys Cys Phe 
245 250 255 

Gly Thr Leu Ser Gin Asn Pro Gly Lys Thr Arg Glu Arg Thr Phe Leu 
260 265 270 

Pro Asp Phe Tyr Glu Gin Val Ala Gly Thr He 
275 280 

(2) INFORMATION FOR SEQ ID NO; 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to rnRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ACCAGGAGGG GAAAAAGTCT TACCCCATGG ACATCCCGGG GATTGAGTGT TACCCGAAAA 60 
GGATGAAGAA TGGTATTCCT CCGTCGTGGA CCCCATGCAC CCATIGGGAA AGCCGTGTGG 120 

ccrrrrcrrr cagggatgat agaaaagtgc tcccttggga tggaaaggag gagccitiac iso 

TGGTAGTGGC CGATAGGGTG AQGAATGTTG TGGAGGCTGA TGAOGGGTAT TATCTCGIGG 240 

TGGCTGAGAA CGGACITAAG CIAGAGAAAG GATCAGATTT GAAGGOGAGA GAGGTGAAGG 300 

AGAGITTAGG GATGGTTGTT TIGGIGGIGA GGCOGCCAAG AGAAGATGAT GATGATIGGC 360 

AGACAAGTCA TCAGAACTGG GACTGAATTA ATAGAATCAA TACTCATATG CTGTAACIGA 420 

TTACGGAGTC ATCATGGICA TGTAAAATTT TTGGATAAAG GIGGTAACTT TTTGITCIAA 480 

GATACAATCA GAAACAGAGC AATATTTTTC TCTAAAAAAA AAAAAAAAAA AAAA 534 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Asp lie Pro Gly lie Glu Cys Tyr Pro Lys Arg Met Lys Asn Gly 
15 10 15 

lie Pro Pro Ser Trp Tnr Pro Cys Thr His Trp Glu Ser Arg Val Ala 
20 25 30 
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Phe Ser Phe Arg Asp Asp Arg Lys Val Leu Pro Trp Asp Gly Lys Glu 
35 40 45 

Glu Pro Leu Leu Val Val Ala Asp Arg Val Arg Asn Val Val Glu Ala 
50 55 60 

Asp Asp Gly Tyr Tyr Leu Val Val Ala Glu Asn Gly Leu Lys Leu Glu 
65 70 75 80 

Lys Gly Ser Asp Leu Lys Ala Arg Glu Val Lys Glu Ser Leu Gly Met 
85 90 95 

Val Val Leu Val Val Arg Pro Pro Arg Glu Asp Asp Asp Asp Trp Gin 
100 105 110 

Thr Ser His Gin Asn Trp Asp 
115 

(2) INFORMATION FDR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer V6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGCITTGCA TAACITTGAG G 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer T7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AATAOGACIC ACTATAG 



WO 00/24914 PCT/EP99/07972 

-1 - 

SEQUENCE LISTING 

(1) GENERAL INFORMATICS: 

(i) APPLICANT: 

(A) NAME: NOVARTIS AG 

(B) STREET: Schwarzwaldallee 215 

(C) CITY: Basel 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : 4058 

(G) TELEPHONE: +41 61 324 11 11 

(H) TELEFAX: + 41 61 322 75 32 

(ii) TITLE OF INVENTION: Organic Coirpounds 
(iii) NUMBER OF SEQUENCES: 18 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECHESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cENA to inRNA 
(iii) HYPOTHETICAL: NO , 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ACGTGTCCGT GGAGGCGGGT OGGGTCAGTC GGGTCAGATA CCAAGGTGCC AAGTGGAAGG 60 

TTCTCGGATG GATCTAACCA ATGCAAAAGG TTATTACTOG AGACACCGAG TTIGTGGAGT 120 

GCACTCTAAA ACACCTAAAG TCACTGTGGC TGGTATOGAA CAGAGGTTTT GTCAACAGTG 180 

CAGCAGGTTr CATCAGCTTC CGGAATTTGA CCTAGAGAAA AGGAGTTGCC GCAGGAGACT 240 
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CGCTGGTCAT AATGAGCGAC GAAGGAAGCC ACAGCCTGCG TCTCTCTC7IG TGTTAGCTTC 300 

TCGTTACGGG AGGATCGCAC CTTCGCTTTA CGAAAATGGT GATGCTQC5AA TCAATGGAAG 360 

CTTIUTTGGG AACCAAGAGA TAGGATGGCC AAGTTCAAGA ACATTGGATA CAAGAGTCAT 420 

GAGGCGGCCA GTGTCATCAC CGTCATCGCA GATCAATCCA ATGAATGTAT TTAGTCAAGG 480 

TTCAGTTGGT GGAGGAAGGA CAAGCTTCTC ATCTCCAGAG ATTATGGACA CTAAACTAGA 540 

GAGCTACAAG G 551 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTEKISTICS : 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3 A3 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Met Gly Ser Asn Ser v Gly Pro Gly His Gly Pro Gly Gin Ala 
1 5 10 15 

Glu Ser Gly Gly Ser Ser Thr Glu Ser Ser Ser Phe Ser Gly Gly Leu 
20 25 30 

Met Phe Gly Gin Lys He Tyr Phe Glu Asp Gly Gly Gly Gly Ser Gly 
35 40 45 

Ser Ser Ser Ser Gly Gly Arg Ser Asn Arg Arg Val Arg Gly Gly Gly 
50 55 60 

Ser Gly Gin Ser Gly Gin He Pro Arg Cys Gin Val Glu Gly Cys Gly 
65 70 75 80 

Met Asp Leu Thr Asn Ala Lys Gly Tyr Tyr Ser Arg His Arg Val Cys 
85 90 95 

Gly Val His Ser Lys Thr Pro Lys Val Thr Val Ala Gly He Glu Gin 
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100 



105 



110 



Arg Phe Cys Gin Gin Cys Ser Arg Phe His Gin Leu Pro Glu Phe Asp 
115 120 125 

Leu Glu Lys Arg Ser Cys Arg Arg Arg Leu Ala Gly His Asn Glu Arg 
130 135 140 

Arg Arg Lys Pro Gin Pro Ala Ser Leu Ser Val Leu Ala Ser Arg Tyr 
145 150 155 160 

Gly Arg lie Ala Pro Ser Leu Tyr Glu Asn Gly Asp Ala Gly Met Asn 
165 170 175 

Gly Ser Phe Leu Gly Asn Gin Glu lie Gly Trp Pro Ser Ser Arg Thr 
180 185 190 

Leu Asp Thr Arg Val Met Arg Arg Pro Val Ser Ser Pro Ser Trp Gin 
195 200 205 

lie Asn Pro Met Asn Val Phe Ser Gin Gly Ser Val Gly Gly Gly Arg 
210 215 220 

Thr Ser Phe Ser Ser Pro Glu lie Met Asp Thr Lys Leu Glu Ser Tyr 
225 230 235 240 

Lys Gly lie Gly Asp Ser Asn Cys Ala Leu Ser Leu Leu Ser Asn Pro 
245 250 255 

His Gin Pro His Asp Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn 
260 265 270 

Asn Asn Asn Thr Trp Arg Ala Ser Ser Gly Phe Gly Pro Met Thr Val 
275 280 285 

Thr Met Ala Gin Pro Pro Pro Ala Pro Ser Gin His Gin Tyr Leu Asn 
290 295 ' 300 

Pro Pro Trp Val Phe Lys Asp Asn Asp Asn Asp Met Ser Pro Val Leu 
305 310 315 320 

Asn Leu Gly Arg Tyr Thr Glu Pro Asp Asn Cys Gin lie Ser Ser Gly 
325 330 335 

Thr Ala Met Gly Glu Phe Glu Leu Ser Asp His His His Gin Ser Arg 
340 345 350 

Arg Gin Tyr Met Glu Asp Glu Asn Thr Arg Ala Tyr Asp Ser Ser Ser 
355 360 365 



His His Thr Asn Trp Ser Leu 
370 375 



(2) INFORMATION FOR SEQ ID NO: 3: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 859 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECJvJESS: double 
<D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cENA to mRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 7 

(A) ORGANISM: Arabidopsis thai i ana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TCAACATTGC TTCCTAACCA GAAATCCACC ATC^TCTTCC CACGAATACA ACTTAAAGCT 60 

TTACCAGAAA ATGGAGGGTC AGAGAACACA ACGCCGGGGT TACTTGAAAG ACAAGGCTAC 120 

AGTCTCCAAC CTTGTTGAAG AAGAAATGGA GAATGGCATG GATGGAGAAG AGGAGGATGG 180 

AGGAGACGAA GACAAAAGGA AGAAGGTGAT GGAAAGAGTT AGAGGTCCTA GCACTGACCG 240 

TGTOCCATCG CGACTGTGCC AGGTCGATAG GTGCACTGTT AATTTGACTG AGGCCAAGCA 300 

GTATTACCGC AGACACAGAG TATGTGAAGT ACATGCAAAG GCATCTCCTG CGACTGTTGC 360 

AGGGGTCAGG CAACGCTTTT GTCAACAATG CAGCAGGTIT CATGAGCTAC CAGAGTITGA 420 

TGAAGCTAAA AGAAGCTGCA GGAGGCGCTT AGCTGGACAC AATGAGAGGA GGAGGAAGAT 480 

CICTGGTGAC AGTTTTGGAG AAGGGTCAGG CCGGAGAGGG TTTAGCGGTC AACTGATCCA 540 

GACTCAAGAA AGAAACAGGG TAGACAGGAA ACTTCCTATG ACCAACICAT CATTTAAGGG 600 

ACCACAGATC AGATAAACCC TCXXGCTCTC TCTCITCTGT CATCTACATA TCCTCTATCT 660 

AC^CTCTTAT TAGACAAATA ATGGCATCTA ACAATGTCAA GAAAAGTTGG TCATCGTATT 720 

AAATCCTAGA GGGAAATATA AGTATAAACC TITAGTCCCC TTTATGCTGT CCTGTAATGA 780 

ATATCTATCC GGAAATGTAT TCGCATAGTC TTGCGTCTAA TAATCITTAT TAAAAAAAAA 840 

AAAAAAAAAA AAAAAAAAA 859 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE OIARACTERISTICS: 

(A) LENGTH: 181 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEEKESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B39 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Gly Gin Arg Thr Gin Arg Arg Gly Tyr Leu Lys Asp Lys Ala 
15 10 15 

Thr Val Ser Asn Leu Val Glu Glu Glu Met Glu Asn Gly Met Asp Gly 
20 25 30 

Glu Glu Glu Asp Gly Gly Asp Glu Asp Lys Arg Lys Lys Val Met Glu 
35 40 45 

Arg Val Arg Gly Pro Ser Thr Asp Arg Val Pro Ser Arg Leu Cys Gin 
50 55 60 

Val Asp Arg Cys Thr Val Asn Leu Thr Glu Ala Lys Gin Tyr Tyr Arg 
65 70 75 80 

Arg His Arg Val Cys Glu Val His Ala Lys Ala Ser Ala Ala Thr Val 
85 90 95 

Ala Gly Val Arg Gin Arg Phe Cys Gin Gin Cys Ser Arg Phe His Glu 
100 105 110 

Leu Pro Glu Phe Asp Glu Ala Lys Arg Ser Cys Arg Arg Arg Leu Ala 
115 120 125 

Gly His Asn Glu Arg Arg Arg Lys He Ser Gly Asp Ser Phe Gly Glu 
130 135 140 

Gly Ser Gly Arg Arg Gly Phe Ser Gly Gin Leu He Gin Thr Gin Glu 
145 150 155 160 

Arg Asn Arg Val Asp Arg Lys Leu Pro Met Thr Asn Ser Ser Phe Lys 
165 170 175 

Gly Pro Gin He Arg 
180 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEC3SIESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cWA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) AKTTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis tha liana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4B19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AGAAGCAGAA AGGTAAAGCT ACAAGTAGTA GTGGAGTTTG TCAGGTCGAG AGTTGTACCG 60 

CGGATATGAG CAAAGCCAAA CAGTACCACA AACGACACAA AGTOTGCCAG TTTCATGCCA 120 

AAGCTCTTCA TGTTCGGATC TCTGGTCTTC ACCAACGTTT CIX3CCAACAA TGCAGCAGGT 180 

TTCACGCGCT CAGTGAGTTT GATGAAGCCA AGCGGAGTPG CAGGAGACGC TTAGCTGGAC 240 

ACAACGAGAG AAGGCGGAAA AGCACAACTG ACTAAAGACG GTGAAACGTG TGAGATCCCG 300 

GTTTGAAGGT TAATGAAACA GGCITTCCTT ACTCTCTTCT GTCAGTCTCT TTTAGCTCCT 360 

TGTAATCCTC TGTGTCTCTG TCTGTTTCTC CATATTACCT GTAATCAAAG CTATCTGCTA 420 

AACCTACGAC ATGGTTAAAT AAATGCATTC AGACTTAAAA AAAAAAAA&A AAAAAAAAA 479 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEDX3TH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MDLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
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(vii) IMMEDIATE SOURCE: 
(B) CLOSE: 4B19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 6: 

Met Ser Met Arg Arg Ser Lys Ala Glu Gly Lys Arcr Ser Leu Arg Glu 
15 10 15 

Leu Ser Glu Glu Glu Glu Glu Glu Glu Glu Thr Glu Asp Glu Asp Thr 
20 , 25 30 

Phe Glu Glu Glu Glu Ala Leu Glu Lys Lys Gin Lys Gly Lys Ala Thr 
35 40 45 

Ser Ser Ser Gly Val Cys Gin Val Glu Ser Cys Thr- Ala Asp Met Ser 
50 55 60 

Lys Ala Lys Gin Tyr His Lys Arg His Lys Val Cys Gin Phe His Ala 
65 70 75 80 

Lys Ala Pro His Val Arg He Ser Gly Leu His Gin Arg Phe Cys Gin 
85 90 95 

Gin Cys Ser Arg Phe His Ala Leu Ser Glu Phe Asp Glu Ala Lys Arg 
100 105 no 

Ser Cys Arg Arg Arg Leu Ala Gly His Asn Glu Arg Arg Arg Lys Ser 
115 120 125 

Thr Thr Asp 
130 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEKK3TH; 2682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEC^ESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cWA to iriRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A52 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCCATTCAAG GAGACACTAA TGGTGCTCTT ACTTTCAATC TTAATGGTGA AAGTGATGGC 60 

CTTITTCCTG CCAAGAAGAC CAAATCCGGA GCCGTTT3TC AGGTCGAAAA CTCTGAAGCT 120 

GATCTEAGTA AAGTTAAGGA TTATCATAGA CGCCATAAGG TCTGTSAGAT GCATTCCAAG 180 

GCTACTAGTG CCACTGTOGG AGGTATCTTG CAGCGCTTTT GTCAGCAATG TAGTAGGTIC 240 

CATCTTCTCC CAGGTTTOGA TCACGGAAAG AGAAGTTGTC GTAGACGTTT GGCIGGCCAT 300 

AATAAACGTC CGAGGAAAAC AAATCCCGAA CCTX3GCGCTA ACGGGAATCC TAGTGATGAT 360 

CACTCAAGCA ACTATCTCTT GATTACTCTC TTGAAGATAC TCTCCAATAT GCATAACCAT 420 

ACCGGTGATC AAGATITCAT GTCTCATCTT CTGAAGAGCC TCGTAAGCCA TGCTGGCGAA 480 

CAGTTAGGGA AAAACTTAGT TGAACTTCTT CTACAAGGAG AGATCTCAAG GTTCCTTAAA 540 

ATATTGGAAA ACT03GCTTT GCTTGGGATT GAGCAAGCTC CTCAAGAGGA GTTAAAGCAA 600 

TTTrOGGCIC ggcaagatgg GACAGCTACC GAGAACAGAT CAGAAAAACA AGTCAAAATC 660 

AATGATTTTG ATTTGAATGA TATCTATATA GACTCAGATG ACACAGAOGT CGAAAGATCT 720 

CCTCCTCCAA CGAATCCAGC GACCAGTTCT CTTGATTATC CTTCATCGAT ACATCAGICT 780 

AG?IXXGCCTC AGACAAGTAG GAATTCAGAT TCAGCATCTG ACCAGTCACC CTCAAGTTCT 840 

AGTGAAGATG CICAGATGCG CACAGGCCGG ATIGTGTTCA AACTATITCG GAAAGAGOCA 900 

AATGAATTTC OTATTGTCTT AOGAGGACAG ATTCTTGACT GGTTATCGCA TAGTCCAACT 960 

GACATGGAGA GCTACATAAG ACCTGGCTCT ATCGTATTGA CCATCTATCT TCGTCAAGCT 1020 

GAAACTGCTT GGGAAGAACT TTCAGACGAT CTGGGTrTTA GCTTAGGGAA GCTTCTAGAT 1080 

CTCTCOGATG ATCXTCTTGTG GACAACTGGA TGGATTTATC TAGGGTGCAG AACCAACTTG 1140 

CATTK7TATA TAACGGTCAG GTTGTCGTK3 ACACTTCATT GTCTCTAAAA AGTCGTGATT 1200 

ATAGTCACAT CATTAGCGTT AAACCGCITC CTATAGCTGC AACGGAGAAG GCTCAATTTA 1260 

CAGTTAAAGG CATGAATCTC CGTCGGCGTG GCACAAGGTT ACTTTGTTCT GTTGAAGGAA 1320 

AATACTTGAT TCAGGAAACA ACACACGATT OGACX3ACCAG GGAGGATCAC GATTTCAAGG 1380 

ACAACAGTGA GATTGTTGAG TGTGTAAACT TCTCTTCTCA TATCCCTATA TTGAGTGGTC 1440 

GAGGATTCAT GGAGATTGAA GACCAAGGAC TCAGTAGCAG CTTCTTCGCT TTCTTAGTCG 1500 

TTCAAGATGA CGATGTTTCT TCTGAAATCC GTATACTTGA AACCACATTA GAGTTCACTG 1560 

GAACTGATrC TGCTAAGCAA GCTATGGATT TCATACATGA AATCGGTTGG CITCITCACA 1620 
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GAAGTAAACT 


TGGQGAATCA GACCCAAATC CAGGCGTTIT CCCATTAATA OXTTCCAGT 


1680 


GGCTAATCGA 


GTTCTCAATG GATCGAGAGT GGTGCGCTGT G&TCAGAAAG CTATTAAACA 


1740 


TCTTCTTTGA 


TGGAGCl^lT GGTGAATTTT CTTCCICCTC TAATGCCACA CTGTCAGAAC 


1800 


TCTGCCTTCT 


TCACAGAGCC GTGAGGAAAA ACTCTAAGCC TATGGTTCSAA ATGCTCTTGA 


1860 


GATATATTCC 


CAAGCAACAG AGAAACAGCT TGTTTAGACC (XATGCTGCT GGTCCAGCCG 


1920 


GCTTAACACC 


TCTTCATATT GCAGCTGGTA AAGACGGTTC AGAAGATGTC TTGGATGOGC 


1980 


TAACAGAAGA 


TCCTGCAATG GTGGGGATTC AAGCGTGGAA GAGATCTCX5A GACAGCACAG 


2040 


GCTTCACACC 


AGAAGACTAC GCACGCTTAC GCGGTCACTT CTCATACATC CACTTCATTC 


2100 


AACGCAAGAT 


CAATAAAAAG TCAACAACTG AAGATCATGT TGTCGTCAAC ATCCCAGrTTT 


2160 


crrrcTCAGA 


CAGAGAGCAG AAAGAACCAA AATCAGGTCC GATGGCTTCA GCCTTGGAGA 


2220 


TCACACAGAT 


TCCATCCAAG CTCTGTGACC ATAAACTGGT GTATGGGACA ACACGCAGGT 


2280 


CTGTAGCGTA 


CAGACCAGCT ATGTTCTCAA TGGTGGQGAT TGCTCCGGITT TGCGTCTGIG 


2340 


TGGCACTTCT 


GTTTAAGAGT TGCCCGGAAG TCCTCTATCT GTTTCAACCG TTCAGGTGGG 


2400 


AGTTATTGGA 


CTATGGAACA AGCTGAGTGT AAGTCTACTT TGAAAGA3 1 CT TCTAAGATAT 


2460 


ATATATGAAT 


GTTACTTATA TAAAACCATA GAGGTCTGAT TTCTATAIGT AACTATATGA 


2520 


GTATAAGATA 


TAGAGACATG TTGGAGAAGA AGATICTTGT TATTATTG!TT CTTO11X3ITC 


2580 


TTGuDGTAAAA 


GCCTCIOCTA TCTCTCTCGA ACCTAAGGAT TCTCTCTCTG ATTAGTATAT 


2640 




ACAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 


2682 


(2) INFORMATICS FOR SBQ ID NO: 8: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LE3NK3IH: 848 amino acids 

(B) TYPE; amino acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 
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(B) CLOSE: : 3A52 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Glu Ala Arg lie Asp Glu Gly Gly Glu Ala Gin Gin Phe Tyr Gly 
15 10 15 

Ser Val Gly Asn Ser Ser Asn Ser Ser Ser Ser Cys Ser Asp Glu Gly 
20 25 30 

Asn Asp Lys Lys Arg Arg Ala Val Ala lie Gin Gly Asp Thr Asn Gly 
35 40 45 

Ala Leu Thr Leu Asn Leu Asn Gly Glu Ser Asp Gly Leu Phe Pro Ala 
50 55 60 

Lys Lys Thr Lys Ser Gly Ala Val Cys Gin Val Glu Asn Cys Glu Ala 
65 70 75 80 

Asp Leu Ser Lys Val Lys Asp Tyr His Arg Arg His Lys Val Cys Glu 
85 90 95 

Met His Ser Lys Ala Thr Ser Ala Thr Val Gly Gly lie Leu Gin Arg 
100 105 110 

Phe Cys Gin Gin Cys Ser Arg Phe His Leu Leu Pro Gly Phe Asp Asp 
115 120 125 

Gly Lys Arg Ser Cys Arg Arg Arg Leu Ala Gly His Asn Lys Arg Pro 
130 135 140 

Arg Lys Thr Asn Pro Glu Pro Gly Ala Asn Gly Asn Pro Ser Asp Asp 
145 150 155 160 

His Ser Ser Asn Tyr Leu Leu lie Thr Leu Leu Lys lie Leu Ser Asn 
165 170 175 

Met His Asn His Thr Gly Asp Gin Asp Leu Met Ser His Leu Leu Lys 
180 185 190 

Ser Leu Val Ser His Ala Gly Glu Gin Leu Gly Lys Asn Leu Val Glu 
195 200 205 

Leu Leu Leu Gin Gly Arg Arg Ser Gin Gly Ser Leu Asn He Gly Asn 
210 215 220 

Ser Ala Leu Leu Gly He Glu Gin Ala Pro Gin Glu Glu Leu Lys Gin 
225 230 235 240 

Phe Ser Ala Arg Gin Asp Gly Thr Ala Thr Glu Asn Arg Ser Glu Lys 
245 250 255 

Gin Val Lys Met Asn Asp Phe Asp Leu Asn Asp He Tyr He Asp Ser 
260 265 270 
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Asp Asp Thr Asp Val Glu Arg Ser Pro Pro Pro Thr Asn Pro Ala Thr 
275 280 285 

Ser Ser Leu Asp Tyr Pro Ser Trp He His Gin Ser Ser Pro Pro Gin 
290 295 300 

Thr Ser Arg Asn Ser Asp Ser Ala Ser Asp Gin Ser Pro Ser Ser Ser 
305 310 315 320 

Ser Glu Asp Ala Gin Met Arg Thr Gly Arg He Val Phe Lys Leu Phe 
325 330 335 

Gly Lys Glu Pro Asn Glu Phe Pro He Val Leu Arg Gly Gin He Leu 
340 345 350 

Asp Trp Leu Ser His Ser Pro Thr Asp Met Glu Ser Tyr He Arg Pro 
355 360 365 

Gly Cys He Val Leu Thr He Tyr Leu Arg Gin Ala Glu Thr Ala Trp 
370 375 380 

Glu Glu Leu Ser Asp Asp Leu Gly Phe Ser Leu Gly Lys Leu Leu Asp 
385 390 395 400 

Leu Ser Asp Asp Pro Leu Trp Thr Thr Gly Trp He Tyr Val Arg Val 
405 410 415 

Gin Asn Gin Leu Ala Phe Val Tyr Asn Gly Gin Val Val Val Asp Thr 
420 425 430 

Ser Leu Ser Leu Lys Ser Arg Asp Tyr Ser His He He Ser Val Lys 
435 440 445 



Pro Leu Ala He Ala Ala Thr Glu Lys Ala Gin Phe Thr Val Lys Gly 
450 455 v 460 

Met Asn Leu Arg Arg Arg Gly Thr Arg Leu Leu Cys Ser Val Glu Gly 
465 470 475 480 

Lys Tyr Leu He Gin Glu Thr Thr His Asp Ser Thr Thr Arg Glu Asp 
485 490 495 

Asp Asp Phe Lys Asp Asn Ser Glu He Val Glu Cys Val Asn Phe Ser 
500 505 510 

Cys Asp Met Pro He Leu Ser Gly Arg Gly Phe Met Glu He Glu Asp 
515 520 525 

Gin Gly Leu Ser Ser Ser Phe Phe Pro Phe Leu Val Val Glu Asp Asp 
530 535 5 4o 

Asp Val Cys Ser Glu He Arg He Leu Glu Thr Thr Leu Glu Phe Thr 
545 550 555 560 
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Gly Thr Asp Ser Ala Lys Gin Ala Met Asp Phe lie His Glu He Gly 
565 570 575 

Trp Leu Leu His Arg Ser Lys Leu Gly Glu Ser Asp Pro Asn Pro Gly 
580 585 590 

Val Phe Pro Leu He Arg Phe Gin Trp Leu He Glu Phe Ser Met Asp 
595 600 605 

Arg Glu Trp Cys Ala Val He Arg Lys Leu Leu Asn Met Phe Phe Asp 
610 615 620 

Gly Ala Val Gly Glu Phe Ser Ser Ser Ser Asn Ala Thr Leu Ser Glu 
625 630 635 640 

Leu Cys Leu Leu His Arg Ala Val Arg Lys Asn Ser Lys Pro Met Val 
645 650 655 

Glu Met Leu Leu Arg Tyr He Pro Lys Gin Gin Arg Asn Ser Leu Phe 
660 665 670 

Arg Pro Asp Ala Ala Gly Pro Ala Gly Leu Thr Pro Leu His He Ala 
675 680 685 

Ala Gly Lys Asp Gly Ser Glu Asp Val Leu Asp Ala Leu Thr Glu Asp 
690 695 700 

Pro Ala Met Val Gly He Glu Ala Trp Lys Thr Cys Arg Asp Ser Thr 
705 710 715 720 

Gly Phe Thr Pro Glu Asp Tyr Ala Arg Leu Arg Gly His Phe Ser Tyr 
725 730 735 

He His Leu He Gin Arg Lys He Asn Lys Lys Ser Thr Thr Glu Asp 
740 745 750 

His Val Val Val Asn He Pro Val Ser Phe Ser Asp Arg Glu Gin Lys 
755 760 765 

Glu Pro Lys Ser Gly Pro Met Ala Ser Ala Leu Glu He Thr Gin He 
770 775 780 



Pro Cys Lys Leu Cys Asp His Lys Leu Val Tyr Gly Thr Thr Arc? Ara 
785 790 795 800 

Ser Val Ala Tyr Arg Pro Ala Met Leu Ser Met Val Ala He Ala Ala 
805 810 815 

Val Cys Val Cys Val Ala Leu Leu Phe Lys Ser Cys Pro Glu Val Leu 
820 825 830 

Tyr Val Phe Gin Pro Phe Arg Trp Glu Leu Leu Asp Tyr Gly Thr Ser 
835 840 845 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEH3TH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to iriRNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 
<B) CLONE: 4B11 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CAGCGGAAGA GCTCACCGTT GAAGAGAGGA ATCTCCTCTC TGTTGCTTAC AAAAACGTGA 60 

TCGG&TCTCT ACGCGCCGCC TGGAGGATCG TGTCTTCGAT TGAGCAGAi\G GAAGAGAGTA 120 

GGAAGAACGA CGAGCACGTG TCGCTTGTCA AGGATTACAG ATCTAAAGTT GAGTCTGAGC 180 

TITCTTCTGT TTGCTCTGGA ATCCTTAAGC TCCTTGACTC GCATCTGATC CCATCTGCTG 240 

GAGCGAGTCA GTCTAAGGTC TTTTACTTGA AGATGAAAGG TGATTATCAT CGGTACATGG 300 

CTGAGTTTAA GTCTGGTGAT GAGAGGAAAA CTGCTGCTGA AGATACCATC CTCGCITACA 360 

AAGCAGCTCA GGATATCGCA GCTGOGGATA TGGCACCTAC TCATCCGATA AGGCTTGGTC 420 

TGGCCCTGAA TTBCIX^GIG TICTACTATG AGATTCTCAA TTCTTCAGAC AAAGCTTCTA 480 

ACATGGCCAA ACAGGC1T1T GAGGAAGCCA TAGCTGAGCT TGACACTCTG GGAGAAGAAT 540 

CCTACAAAGA GAGCACTCTC ATAATGCAGT TGOTGA 576 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
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(iii) AOTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE; 
<B) CLONE: 4B11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Thr Leu Gly Arg Asp Gin Tyr Val Tyr Met Ala Lys Leu 
1 5 10 15 

Ala Glu Gin Ala Glu Arg Tyr Glu Glu Met Val Gin Phe Met Glu Gin 
20 25 30 

Leu Val Thr Gly Ala Thr Pro Ala Glu Glu Leu Thr Val Glu Glu Arg 
35 40 45 

Asn Leu Leu Ser Val Ala Tyr Lys Asn Val lie Gly Ser Leu Arg Ala 
50 55 60 

Ala Trp Arg lie Val Ser Ser lie Glu Gin Lys Glu Glu Ser Arg Lys 
65 70 75 80 

Asn Asp Glu His Val Ser Leu Val Lys Asp Tyr Arg Ser Lys Val Glu 
85 90 95 

Ser Glu Leu Ser Ser Val Cys Ser Gly lie Leu Lys Leu Leu Asp Ser 
100 105 110 

His Leu He Pro Ser Ala Gly Ala Ser Glu Ser Lys Val Phe Tyr Leu 
115 120 125 

Lys Met Lys Gly Asp Tyr His Arg Tyr Met Ala Glu Phe Lys Ser Gly 
130 135 140 

Asp Glu Arg Lys Thr Ala Ala Glu Asp Thr Met Leu Ala Tyr Lys Ala 
145 150 155 160 

Ala Gin Asp He Ala Ala Ala Asp Met Ala Pro Thr His Pro He Arg 
165 170 175 

Leu Gly Leu Ala Leu Asn Phe Ser Val Phe Tyr Tyr Glu He Leu Asn 
180 185 190 

Ser Ser Asp Lys Ala Cys Asn Met Ala Lys Gin Ala Phe Glu Glu Ala 
195 200 205 

He Ala Glu Leu Asp Thr Leu Gly Glu Glu Ser Tyr Lys Asp Ser Thr 
210 215 220 

Leu He Met Gin Leu Leu Arg Asp Asn Leu Thr Leu Trp Thr Ser Asp 
225 230 235 240 
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Met Gin Glu Gin Met Asp Glu Ala 
245 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEW3TH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

/ 

(ii) MOLECULE TYPE: cDNA to iriRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CGCCGCCACC GCGATGTAOG TGATCTACCA CCCTOTTCCG CCGTCGT^PCT CX^GTCXXCTC 60 

AATAAGAATC AGCCGCGTGA ACCTAACAAC CTCCTCIGAT TCXJTCCCTCT CTCATCTOTC 120 

TTCCTTCTTC AACTTCACTC TAATCTCAGA GAATCCAAAC CAACACCrCT CTITCTCTTA 180 

OSATCXTTTC ACCGTCACCG TTAATTCAGC TAAATCCGGT ACGATGCTCG GTAACGGAAC 240 

TGTTCCTCCT TTCTTCAGCG ATAAOGCTAA CAAAACTTCG TTTCACGC5CG TGATCGCTAC 300 

GTCTACAGCG GCGOGTGAGT TAGATCOGGA TGAAGCTAAG CATCTGAC5AT CAGATCTGAC 360 

GCGCGOGCGT GTAGGATATG AGATCGAGAT GAGAACTAAA GTGAAGATGA TAATGGGGAA 420 

GCTGAAGAGT GAAGGAGTAG AGATCAAAGT GACATGTTGA AGGAITTGAA GGAACTATAC 480 

CAAAAGGTAA AACTCCAATT GTAGCTACIT CTAAAAAAAC TAAGTGTAAG TCIGATCTTA 540 

GTGTCAAGTC TGGAAATQGA TTTCTAAAGG AATTTGATAA TITCACAITG AAATTCTATA 600 

TATCTCTCTT TTTCTCTGGA TTTGTGAAAC TTTGGATGAT CAAAGAATTC TTCATTGTC 659 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEEK7IH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDECNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLEOJLE TYPE: protein 
(iii) HYPOTHETICAL: 1SJO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: , 

(B) CLONE: 4A24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg lie Cys Cys Cys Cys Phe Trp Ser lie Leu lie lie Leu lie Leu 
15 10 15 

Ala Leu Met Thr Ala lie Ala Ala Thr Ala Met Tyr Val lie Tyr His 
20 25 30 

Pro Arg Pro Pro Ser Phe Ser Val Pro Ser lie Arg lie Ser Arg Val 
35 40 45 

Asn Leu Thr Thr Ser Ser Asp Ser Ser Val Ser His Leu Ser Ser Phe 
50 55 60 

Phe Asn Phe Thr Leu lie Ser Glu Asn Pro Asn Gin. His Leu Ser Phe 
65 70 75 80 

Ser Tyr Asp Pro Phe Thr Val Thr Val Asn Ser Ala Lys Ser Gly Thr 
85 90 95 

Met Leu Gly Asn Gly Thr Val fro Ala Phe Phe Ser Asp Asn Gly Asn 
100 105 110 

Lys Thr Ser Phe His Gly Val lie Ala Thr Ser Thr Ala Ala Arg Glu 
115 120 125 

Leu Asp Pro Asp Glu Ala Lys His Leu Arg Ser Asp Leu Thr Arg Ala 
130 135 140 

Arg Val Gly Tyr Glu He Glu Met Arg Thr Lys Val Lys Met He Met 
145 150 155 160 

Gly Lys Leu Lys Ser Glu Gly Val Glu He Lys Val Thr Cys 
165 170 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEEHESS : double 

(D) TOPOLOGY; linear 

(ii) MOLEHJLE TYPE: cESNIA to iriRNA 
(iii) HYPOTHETICAL, : NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B76 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCTCCAACTC CAGGCCAGCC AACAAAAGAA CCTACATTTA TTCCAGTGGT TGTTGGTCIT 60 

TTGGACTCAA GTGGGAAAGA CATTACTCTT TCCTCTGTTC ATTATGATCG TACAGTGCAG 120 

ACCAITTCAG GCAGCAGCAC AATACTTCGA GTGACAAGAA ACAAGAAGAG TTTGTGTTTT 180 

CTGATATACC AGAAAGACCT GTTCX^GTCCC TATTTAGGGG ATICAGCCCC AGTICGTGfIT 240 

GAAACTGATC TCTCTAATGA TGACTTATTC TICCTCCTAG CACATGATPC AGATGAATIC 300 

AATAGGTGGG AGGCCGGTCA AGTTCTGGCA AGAAAGCTGA TGCTGAACPT AGTTTCTGAT 360 

TTCCAGCAAA ATAAACCGTT GGCTCTAAAC CCAAAATTTG TGCAAGCTCT CGGCAGTGTG 420 

CTTTCTGACT CAAGCITGGA CAAGGAATTT ATAGCCAAAG CAATAACACT ACCTGGGGAG 480 

GGAGAGATAA TGGACATGAT GGCCXJTGGCG GATCCTGATG CTCITCATGC TGTTAGAAAG 540 

ITTGTACGAA AGCAGCTTGC ATCTGAACTT AAGGAGGAGC TTCT 584 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEN3TH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDETXJESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 3B76 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Pro Thr Pro Gly Gin Pro Thr Lys Glu Pro Thr Phe He Pro Val 
15 10 15 

Val Val Gly Leu Leu Asp Ser Ser Gly Lys Asp He Thr Leu Ser Ser 
20 25 30 

/ 

Val His Tyr Asp Gly Thr Val Gin Thr He Thr Gly Ser Ser Thr He 
35 40 45 

Leu Arg Val Thr Lys Lys Gin Glu Glu Phe Val Phe Ser Asp He Pro 
50 55 60 

Glu Arg Pro Val Pro Ser Leu Phe Arg Gly Phe Ser Ala Pro Val Arg 
65 70 75 80 

Val Glu Thr Asp Leu Ser Asn Asp Asp Leu Phe Phe Leu Leu Ala His 
85 90 95 

Asp Ser Asp Glu Phe Asn Arg Trp Glu Ala Gly Gin Val Leu Ala Arg 
100 105 no 

Lys Leu Met Leu Asn Leu Val Ser Asp Phe Gin Gin Asn Lys Pro Leu 
115 120 125 

Ala Leu Asn Pro Lys Phe Val Gin Gly Leu Gly Ser Val Leu Ser Asp 
130 135 140 

Ser Ser Leu Asp Lys Glu Phe He Ala Lys Ala He Thr Leu Pro Gly 
i45 150 155 160 

Glu Gly Glu He Met Asp Met Met Ala Val Ala Asp Pro Asp Ala Val 
165 170 175 

His Ala Val Arg Lys Phe Val Arg Lys Gin Leu Ala Ser Glu Leu Lys 
180 185 190 

Glu Glu Leu Lys He Val Glu Asn Asn Arg Ser Thr Glu Ala Tyr Val 
195 200 205 

Phe Asp His Ser Asn Met Ala Arg Arg Ala Leu Lys Asn Thr Ala Leu 
210 215 220 

Ala Tyr Leu Ala Ser Leu Glu Asp Pro Ala Tyr Met Gly Thr Cys Thr 
225 230 235 240 

Glu Arg He Gin Gly Gly His Gin Phe Asp Arg Pro He Cys Cys Phe 
245 250 255 

Gly Thr Leu Ser Gin Asn Pro Gly Lys Thr Arg Glu Arg Thr Phe Leu 
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260 265 270 

Pro Asp Phe Tyr Glu Gin Val Ala Gly Thr He 
275 280 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE3NK3TH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDELHESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis t±Laliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ACCAGGAGGG GAAAAAGTCT TACCCCATGG ACATCCCGGG GATTGAGTGT TACCCGAAAA 60 

GGATGAAGAA TGGTATTCCT CCGTCGTGGA CCCCATGCAC CCATTGGGAA AGCCGTCTCG 120 

OJm'MTr CAGGGATGAT AGAAAAGTGC TCCCTTGGGA TGGAAAGGAG GAGCCTTTAC 180 

TGCTAGTGGC CGATAGGGTG AGGAATGTTC TGGAGGCTGA TGACGGGTAT TATCTCGTGG 240 

TGGCTGAGAA CGGACTTAAG CTAGAGAAAG ^TCAGATTT GAAGGCGAGA GAGGTGAAGG 300 

AGAGTTTAGG GATGGITGTT TTGGTGGTGA GGCCGCCAAG AGAAGATGAT GATGATTGGC 360 

AGACAAGTCA TCAGAACTGG GACTGAATTA ATAGAATCAA TACTCATATG CTGTAACTGA 420 

TTACGGAGTC ATCATGGTCA TGTAAAATTT TTGGATAAAG GTGGTAACTT TTIGTTCTAA 480 

GATACAATCA GAAACAGAGC AATATTTTTC TCTAAAAAAA AAAAAAAAAA AAAA 534 
(2) INFORMATIOSr FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEISK?IH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEOMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thai i ana 

(vii) IMMEDIATE SOURCE: 
<B) CLONE: 4A5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Asp lie Pro Gly lie Glu Cys Tyr Pro Lys Arg Met Lys Asn Gly 
15 10 15 

lie Pro Pro Ser Trp Thr Pro Cys Thr His Trp Glu Ser Arg Val Ala 
20 25 30 

Phe Ser Phe Arg Asp Asp Arg Lys Val Leu Pro Trp Asp Gly Lys Glu 
35 40 45 

Glu Pro Leu Leu Val Val Ala Asp Arg Val Arg Asn Val Val Glu Ala 
50 55 60 

Asp Asp Gly Tyr Tyr Leu Val Val Ala Glu Asn Gly Leu Lys Leu Glu 
65 70 75 80 

Lys Gly Ser Asp Leu Lys Ala Arg Glu Val Lys Glu Ser Leu Gly Met 
85 90 95 

Val Val Leu Val Val Arg Pro Pro Arg Glu Asp Asp Asp Asp Trp Gin 
100 105 110 

Thr Ser His Gin Asn Trp Asp , 
115 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ENA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLC&E: primer V6 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGCTTTCCA TAACTTTGAG G 21 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LEN3TH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE3SIESS: jingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: HMA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer T7 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AATACGACTC ACTATAG 



17 



